Native Language Identification Across Text Types: How Special Are Scientists?

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Native Language Identification on Text and Speech

This paper presents an ensemble system combining the output of multiple SVM classifiers to native language identification (NLI). The system was submitted to the NLI Shared Task 2017 fusion track which featured students essays and spoken responses in form of audio transcriptions and iVectors by non-native English speakers of eleven native languages. Our system competed in the challenge under the...

متن کامل

Generalization in Native Language Identification: Learners versus Scientists

English. Native Language Identification (NLI) is the task of recognizing an author’s native language from text in another language. In this paper, we consider three English learner corpora and one new, presumably more difficult, scientific corpus. We find that the scientific corpus is only about as hard to model as a less-controlled learner corpus, but cannot profit as much from corpus combinat...

متن کامل

(Non)native Language Teachers’ Cognitions: Are They Dichotomous?

In view of native/nonnative language teacher dichotomy, different characteristics have been assigned to these 2 groups. The dichotomy has been the source of different actions and measures to clarify the positive and negative points of being (non)native teachers. In recent years, many researchers have revisited this dichotomy. The challenge to the dichotomy can be a source of motivation to explo...

متن کامل

Parser evaluation across text types

When a statistical parser is trained on one treebank, one usually tests it on another portion of the same treebank, partly due to the fact that a comparable annotation format is needed for testing. But the user of a parser may not be interested in parsing sentences from the same newspaper all over, or even wants syntactic annotations for a slightly different text type. Gildea (2001) for instanc...

متن کامل

From Language to Family and Back: Native Language and Language Family Identification from English Text

Revealing an anonymous author’s traits from text is a well-researched area. In this paper we aim to identify the native language and language family of a non-native English author, given his/her English writings. We extract features from the text based on prior work, and extend or modify it to construct different feature sets, and use support vector machines for classification. We show that nat...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Italian Journal of Computational Linguistics

سال: 2016

ISSN: 2499-4553

DOI: 10.4000/ijcol.348